Theory Based Approach
Describe the sampling distribution of a statistic and define the standard error.
Know when/why simulation and theory will yield different results.
So far, we’ve used a simulation-based approach to gather strength of evidence.
But…
Since we know the null distribution is normally distributed, we know the following:
If sample size is too small:
Theory-based “rule of thumb”:


Definition:
standard\: deviation\: of\: the\: null=\sqrt{\frac{\pi(1-\pi)}{n}}
Replace S.D. with theory-based calculation
z=\frac{Observed\: Statistic - Hypothesized\: Value}{Standard\: Deviation\: under\: the\: null}
=\frac{\hat{p}-\pi}{\sqrt{\frac{\pi(1-\pi)}{n}}}
We can still use the One Proportion Applet for the theory-based approach
In more advanced classes, we use calculus to get these values.
Let’s identify the following:
Sample & size
Variable of interest
Parameter & symbol
Observed statistic & symbol
Hypotheses (in symbols)
Do we meet the criteria for the theory-based approach?
| Scenario: |
|---|
| Researchers investigated whether children might be as tempted by toys as by candy for Halloween treats. Test households offered two plates to children: one with candy and one with a small toy. They observed the selections of 283 children that night and found that 148 of the kids chose candy. |
Sample & size: n = 283 kids
Variable of interest: Chose candy or not
Parameter & symbol: \pi = long run proportion of kids that chose candy
Observed statistic & symbol: \hat{p} = \frac{148}{283}
Hypotheses (in symbols):
Do we meet the criteria for the theory-based approach? Yes! More than 10 failures & successes.
Standard deviation using theory:
standard\: deviation\: of\: the\: null=\sqrt{\frac{\pi(1-\pi)}{n}}
=\sqrt{\frac{0.5(1-0.5)}{283}}=0.0297
Then, we use the calculated standard deviation to obtain the standardized statistic:
standardized\: statistic\: =\frac{0.523-0.5}{0.0297}=0.774
Now we find the standardized statistic with the simulation approach:
standardized\: statistic\: =\frac{0.523-0.5}{0.02}=0.793
Notice, we get nearly the same number!
Since our standardized statistic using either approach is not outside the twos, we conclude the following:
“We do not have enough evidence to conclude that the long-run proportion of kids who choose candy differs from 0.5.”
